AITopics | Jeffersontown

Collaborating Authors

Jeffersontown

Confidence Matters: Revisiting Intrinsic Self-Correction Capabilities of Large Language Models

Li, Loka, Chen, Zhenhao, Chen, Guangyi, Zhang, Yixuan, Su, Yusheng, Xing, Eric, Zhang, Kun

arXiv.org Artificial IntelligenceMay-13-2024

The recent success of Large Language Models (LLMs) has catalyzed an increasing interest in their self-correction capabilities. This paper presents a comprehensive investigation into the intrinsic self-correction of LLMs, attempting to address the ongoing debate about its feasibility. Our research has identified an important latent factor - the "confidence" of LLMs - during the self-correction process. Overlooking this factor may cause the models to over-criticize themselves, resulting in unreliable conclusions regarding the efficacy of self-correction. We have experimentally observed that LLMs possess the capability to understand the "confidence" in their own responses. It motivates us to develop an "If-or-Else" (IoE) prompting framework, designed to guide LLMs in assessing their own "confidence", facilitating intrinsic self-corrections. We conduct extensive experiments and demonstrate that our IoE-based Prompt can achieve a consistent improvement regarding the accuracy of self-corrected responses over the initial answers. Our study not only sheds light on the underlying factors affecting self-correction in LLMs, but also introduces a practical framework that utilizes the IoE prompting principle to efficiently improve self-correction capabilities with "confidence". The code is available at https://github.com/MBZUAI-CLeaR/IoE-Prompting.git.

arxiv preprint arxiv, critical prompt, final answer, (14 more...)

arXiv.org Artificial Intelligence

2402.12563

Country:

Europe > Norway (0.14)
North America > United States > California (0.14)
North America > Canada > British Columbia (0.14)
(35 more...)

Genre: Research Report > New Finding (1.00)

Industry: Consumer Products & Services > Restaurants (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.48)

Add feedback